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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

<i) APPLICANT: Trustees of the University of Pennsylvania 
Wilson, James M. 
Fisher, Krishna J. 

(ii) TITLE OF INVENTION: Method for Recombinant Adeno-Associated 
Virus-Directed Gene Therapy 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: Spring House Corporate Cntr, PO Box **/ 

(C) CITY: Spring House 

( D ) STATE : Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19477 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi> CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO 

( B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/708,188 

(B) FILING DATE: 06-SEP-1996 

(vii) PRIOR APPLICATION DATA: rtQ ,„ OQ nfi1 

(A) APPLICATION NUMBER: US 08/729,061 

(B) FILING DATE: 10-OCT-1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Kodroff , Cathy A. 

(B) REGISTRATION NUMBER: 33,980 „ Mn ~m 
(C> REFERENCE /DOCKET NUMBER: GNVPN. 019CIP2PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-540-9200 

(B) TELEFAX: 215-540-5818 



(2) INFORMATION FOR SEQ ID NO:l: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10398 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GAATTCGCTA GCATCATCAA TAATATACCT TATTTTGGAT TGAAGCCAAT ATGATAATGA 
GGGGGTGGAG TTTGTGACGT GGCGCGGGGC GTGGGAACGG GGCGGGTGAC GTAGTAGTGT 
GGCGGAAGTG TGATGTTG C A AGTGTGGCGG AACACATGTA AGCGACGGAT GTGGCAAAAG 
TGACGTTTTT GGTGTGCGCC GGTGTACACA GGAAGTGACA ATTTTCGCGC GGTTTTAGGC 
GGATGTTGTA GTAAATTTGG GCGTAACCGA GTAAGATTTG GCCATTTTCG CGGGAAAACT 
GAATAAGAGG AAGTGAAATC TGAATAATTT TGTGTTACTC ATAGCGCGTA ATATTTGTCT 
AGGGAGATCT GCTGCGCGCT CGCTCGCTCA CTGAGGCCGC CCGGGCAAAG CCCGGGCGTC 
GGGCGACCTT TGGTCGCCCG GCCTCAGTGA GCGAGCGAGC GCGCAGAGAG GGAGTGGCCA 
ACTCCATCAC TAGGGGTTCC TTGTAGTTAA TGATTAACCC GCCATGCTAC TTATCTACAA 
TTCGAGCTTG CATGCCTGCA GGTCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCXGA 
CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
T ACGT AT TAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 
TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 
ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC TCGTTTAGTG 
AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG AAGACACCGG 
GACCGATCCA GCCTCCGGAC TCTAGAGGAT CCGGTACTCG AGGAACTGAA AAACCAGAAA 
GTTAACTGGT AAGTTTAGTC TTTTTGTCTT TTATTTCAGG TCCCGGATCC GGTGGTGGTG 
CAAATCAAAG AACTGCTCCT CAGTGGATGT TGCCTTTACT TCTAGGCCTG TACGGAAGTG 
TTACTTCTGC TCTAAAAGCT GCGGAATTGT ACCCGCGGCC GCAATTCCCG GGGATCGAAA 
GAGCCTGCTA AAGCAAAAAA GAAGTCACCA TGTCGTTTAC TTTGACCAAC AAGAACGTGA 
TTTTCGTTGC CGGTCTGGGA GGCATTGGTC TGGACACCAG CAAGGAGCTG CTCAAGCGCG 
ATCCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG CGTTACCCAA CTTAATCGCC 
TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC ACCGATCGCC 
CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGCGCTT TGCCTGGTTT CCGGCACCAG 
AAGCGGTGCC GGAAAGCTGG CTGGAGTGCG ATCTTCCTGA GGCCGATACT GTCGTCGTCC 
CCTCAAACTG GCAGATGCAC GGTTACGATG CGCCCATCTA CACCAACGTA ACCTATCCCA 
TTACGGTCAA TCCGCCGTTT GTTCCCACGG AGAATCCGAC GGGTTGTTAC TCGCTCACAT 
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TTAATGTTGA 


TGAAAGCTGG 


CTACAGGAAG 


GCCAGACGCG 


AATTATTTTT 


GATGGCGTTA 


1920 


ACTCGGCGTT 


TCATCTGTGG 


TGCAACGGGC 


GCTGGGTCGG 


TTACGGCCAG 


GACAGTCGTT 


1980 


TGCCGTCTGA 


ATTTGACCTG 


AGCGCATTTT 


TACGCGCCGG 


AGAAAACCGC 


CTCGCGGTGA 


2040 


TGGTGCTGCG 


TTGGAGTGAC 


GGCAGTTATC 


TGGAAGATCA 


GGATATGTGG 


CGGATGAGCG 


2100 


GCATTTTCCG 


TGACGTCTCG 


TTGCTGCATA 


AACCGACTAC 


ACAAATCAGC 


GATTTCCATG 


2160 


TTGCCACTCG 


CTTTAATGAT 


GATTTCAGCC 


GCGCTGTACT 


GGAGGCTGAA 


GTTCAGATGT 


2220 


GCGGCGAGTT 


GCGTGACTAC 


CTACGGGTAA 


CAGXTTCTTT 


ATGGCAGGGT 


GAAACGCAGG 


2280 


TCGCCAGCGG 


CACCGCGCCT 


TTCGGCGGTG 


AAATTATCGA 


TGAGCGTGGT 


GGTTATGCCG 


2340 


ATCGCGTCAC 


ACTACGTCTG 


AACGTCGAAA 


ACCCGAAACT 


GTGGAGCGCC 


GAAATCCCGA 


2400 


ATCTCTATCG 


TGCGGTGGTT 


GAACTGCACA 


CCGCCGACGG 


CACGCTGATT 


GAAGCAGAAG 


2460 


CCTGCGATGT 


CGGTTTCCGC 


GAGGTGCGGA 


TTGAAAATGG 


TCTGCTGCTG 


CTGAACGGCA 


2520 


AGCCGTTGCT 


GATTCGAGGC 


GTTAACCGTC 


ACGAGCATCA 


TCCTCTG CAT 


GGTCAGGTCA 


2580 


TGGATGAGCA 


GACGATGGTG 


CAGGATATCC 


TGCTGATGAA 


GCAGAACAAC 


TTTAACGCCG 


2640 


TGCGCTGTTC 


GCATTATCCG 


AACCATCCGC 


TGTGGTACAC 


GCTGTGCGAC 


CGCTACGGCC 


2700 


TGTATGTGGT 


GGATGAAGCC 


AATATTGAAA 


CCCACGGCAT 


GGTGCCAATG 


AATCGTCTGA 


2760 


CCGATGATCC 


GCGCTGGCTA 


CCGGCGATGA 


GCGAACGCGT 


AACGCGAATG 


GTGCAGCGCG 


2820 


ATCGTAATCA 


CCCGAGTGTG 


AT CATCTGGT 


CGCXGGGGAA 


TGAATCAGGC 


CACGGCGCTA 


2880 


ATCACGACGC 


GCTGTATCGC 


TGGATCAAAT 


CTGTCGATCC 


TTCCCGCCCG 


GTG CAGTATG 


2940 


AAGGCGGCGG 


AGCCGACACC 


ACGGCCACCG 


ATATTATTTG 


CCCGATGTAC 


GCGCGCGTGG 


3000 


ATGAAGACCA 


GCCCTTCCCG 


GCTGTGCCGA 


AATGGTCCAT 


CAAAAAATGG 


CTTTCGCTAC 


3060 


CTGGAGAGAC 


GCGCCCGCTG 


ATCCTTTGCG 


AATACGCCCA 


CGCGATGGGT 


AACAGTCTTG 


3120 


GCGGTTTCGC 


TAAATACTGG 


CAGGCGTTTC 


GTCAGTATCC 


CCGTTTACAG 


GGCGGCTTCG 


3180 


TCTGGGACTG 


GGTGGATCAG 


TCGCTGATTA 


AATATGATGA 


AAACGGGAAC 


CCGTGGTCGG 


3240 


CTTACGGCGG 


TGATTTTGGC 


GATACGCCGA 


ACGATCGCCA 


GTTCTGTATG 


AACGGTCTGG 


3300 


TCTTTGCCGA 


CCGCACGCCG 


CATCCAGCGC 


TGACGGAAGC 


AAAACACCAG 


GAG C AGTTTT 


3360 


TCCAGTTCCG 


TTTATCCGGG 


CAAACCAT CG 


AAGTGACCAG 


CGAATACCTG 


TTCCGTCATA 


3420 


GCGATAACGA 


GCTCCTGCAC 


TGGATGGTGG 


CGCTGGATGG 


TAAGCCGCTG 


GCAAGCGGTG 


3480 


AAGTGCCTCT 


GGATGTCGCT 


CCACAAGGTA 


AACAGTTGAT 


TGAACTGCCT 


GAACTACCGC 


3540 


AGCCGGAGAG 


CGCCGGGCAA 


CTCTGGCTCA 


CAGTACGCGT 


AGTGCAACCG 


AACGCGACCG 


3600 


CATGGTCAGA 


AGCCGGGCAC 


ATCAGCGCCT 


GGCAGCAGTG 


GCGTCTGGCG 


GAAAACCTCA 


3660 


GTGTGACGCT 


CCCCGCCGCG 


TCCCACGCCA 


TCCCGCATCT 


GACCACCAGC 


GAAATGGATT 


3720 


TTTGCATCGA 


GCTGGGTAAT 


AAG CGTTGG C 


AATTTAACCG 


CCAGTCAGGC 


TTTCTTTCAC 


3780 



WO 98/09657 



PCT/US97/15692 



AGATGTGGAT 
CACCGCTGGA 
TCGAACGCTG 
CAGATACACT 
AAACCTXATX 
CCGTTGATGT 
AGCTGGCGCA 
CCGACCGCCT 
CCCCGTACGT 
GCCCACACCA 
TGATGGAAAC 
ACGGXXTCCA 
TACAGCXGAG 
CGGGCAGGCC 
ACACAAACXT 
ACTTCCCGTT 
GTATTATTTT 
TTTCTGACAA 
ATTGAXGAGT 
ATTTGTGATG 
AACAATTGCA 
CCTCTAGAGT 
AGTGATGGAG 
AAAGGTCGCC 
CAGATCTGGA 
GGCGGTAAAC 
GATCACTTGG 
TGAGGTACTG 
CTTATGTAGT 
GATGGAAGCA 
CAGAATGTGA 
XTGACCXACG 



TGGCGATAAA 
TAACGACATT 
GAAGGCGGCG 
TGCXGAXGCG 
TATCAGCCGG 
TGAAGTGGCG 
GGT AG C AG AG 
TACTGCCGCC 
CXXCCCGAGC 
GTGGCGCGGC 
CAGCCATCGC 
XAXGGGGAXX 
CGCCGGTCGC 
ATGTCXGCCC 
TXGGAXGXXC 
TTTCCCGAXT 
TGCCGCXATT 
ACTCGGCCTC 
TTGGACAAAC 
CTAXXGCXXX 
XXCAXXXXAX 
CGAGXAGAXA 
XXGGCCACXC 
CGACGCCCGG 
AGGTG CTG AG 
ATATTAGGAA 
XGCTGGCCXG 
AAATGTGTGG 
TXXGTATCTG 
TTGTGAGCTC 
TGGGCXCCAG 
AGACCGXGXC 



AAACAACXGC 
GGCGXAAGXG 
GGCCAXTACC 
GTGCTGATTA 
AAAACCXACC 
AGCGAXACAC 
CGGGXAAACX 
XGXXXXGACC 
GAAAACGGXC 
GACXXCCAGX 
CAXCXGCXGC 
GGXGGCGACG 
XACCAXXACC 
GXAXXXCGCG 
GGXXXAXXCX 
XGGCXACAXG 
XCXCXGXXCX 
GACXCXAGGC 
CACAACXAGA 
AXXXGXAACC 
GXXXCAGGXX 
AGXAGCAXGG 
CCXCXCXGCG 
GCXXXGCCCG 
GXACGATGAG 
CCAGCCXGTG 
CACCCGCGCT 
GCGXGGCTXA 
TTTXGCAGCA 
AT ATT T G ACA 
CATTGATGGT 
TGGAACGCCG 
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TGACGCCGCT 
AAGCGACCCG 
AGGCCGAAGC 
CGACCGCTCA 
GGAXTGATGG 
CGCATCCGGC 
GGCTCGGATT 
GCTGGGATCT 
TGCGCTGCGG 
XGAACATCAG 
ACGCGGAAGA 
ACTCCXGGAG 
AGXXGGXCXG 
XAAGGAAATC 
XXXXCXTXXA 
ACAXCAACCA 
CGCXAXTAXX 
GGCCGCGGGG 
ATGCAGXGAA 
AXXAXAAGCX 
CAGGGGGAGG 
CGGGXXAATC 
CGCXCGCXCG 
GGCGGCCXGA 
ACCCGCACCA 
ATGCTGGATG 
GAGTTXGGCT 
AGGGTGGGAA 
GCCGCCGCCG 
ACGCGCATGC 
CGCCCCGTCC 
TTGGAGACTG 



GCGCGATCAG 
CATTGACCCT 
AGCGTTGXXG 
CGCGTGGCAG 
TAGTGGXCAA 
GCGGAXTGGC 
AGGGCCGCAA 
GCCAXTGXCA 
GACGCGCGAA 
CCGCXACAGX 
AGGCACATGG 
CCCGXCAGXA 
GXGXCAAAAA 
CATTATGTAC 
CXXTXXTAXC 
XAXCAGCAAA 
CCAACCGCXG 
AXCCAGACAT 
AAAAAXGCXX 
GCAAXAAACA 
XGTGGGAGGT 
ATXAACXACA 
CXCACXGAGG 
GXGAGCGAGC 
GGTGCAGACC 
TGACCGAGGA 
CTAGCGATGA 
AGAAXATATA 
CCATGAGCAC 
CCCCATGGGC 
TGCCCGCAAA 
CAGCCTCCGC 



TTCACCCGXG 
AACGCCTGGG 
CAGXGCACGG 
CAXCAGGGGA 
ATGGCGAXTA 
CXGAACXGCC 
GAAAACXATC 
GACATGTATA 
XXGAATTATG 
CAACAGCAAC 
CTGAAXAXCG 
XCGGCGGAAX 
XAATAAXAAC 
XAXXTAAAAA 
AXGGGAGCCX 
AGXGAXACGG 
XXXGGTCTGC 
GAXAAGAXAC 
XAXTXGXGAA 
AGXXAACAAC 
XXXTTCGGAT 
AGGAACCCCX 
CCGGGCGACC 
GAGCGCGCAG 
CXGCGAGTGX 
GCXGAGGCCC 
AGATACAGAT 
AGGTGGGGGT 
CAACTCGTTT 
CGGGGTGCGT 
CTCTACTACC 
CGCCGCTTCA 
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GCCGCTGCAG 


CCACCGCCCG 


CGGGATTGTG 


ACTGACTTTG 


CTTTCCTGAG 


CCCGCTTGCA 


5760 


AGCAGTGCAG 


CTTCCCGTTC 


ATCCGCCCGC 


GATGACAAGX 


TGACGGCTCT 


TTTGGCACAA 


5820 


TTGGATTCTT 


TGACCCGGGA 


ACTTAATGTC 


GTTTCTCAGC 


AGCTGTTGGA 


TCTGCGCCAG 


5880 


CAGGTTTCTG 


CCCTGAAGGC 


TTCCTCCCCT 


CCCAATGCGG 


TTTAAAACAT 


AAATAAAAAA 


5940 


CCAGACTCTG 


TTTGGATTTG 


GATCAAGCAA 


GTGTCTTGCT 


GTCTTTATTT 


AGGGGTTTTO 


6000 


CGCGCGCGGT 


AGGCCCGGGA 


CCAGCGGTCT 


CGGTCGTTGA 


GGGTCCTGTG 


TATTTTTTCC 


6060 


AGGACGTGGT 


AAAGGTGACT 


CTGGATGTTC 


AGATACATGG 


GCATAAGCCC 


GTCTCTGGGG 


6120 


TGGAGGTAGC 


ACCACTGCAG 


AGCTTCATGC 


TGCGGGGTGG 


TGTTGTAGAT 


GATCCAGTCG 


6180 


TAGCAGGAGC 


GCTGGGCGTG 


GTGCCTAAAA 


ATGTCTTTCA 


GTAGCAAGCT 


GATTGCCAGG 


6240 


GGCAGGCCCT 


TGGTGTAAGT 


GTTTACAAAG 


CGGTTAAGCT 


GGGATGGGTG 


CATACGTGGG 


6300 


GATATGAGAT 


GCATCTTGGA 


CTGTATTTTT 


AGGTTGGCTA 


TGTTCCCAGC 


CATATCCCTC 


6360 


CGGGGATTCA 


TGTTGTGCAG 


AACCACCAGC 


ACAGTGTATC 


CGGTGCACTT 


GGGAAATTTG 


6420 


TCATGTAGCT 


TAGAAGGAAA 


TGCGTGGAAG 


AACTTGGAGA 


CGCCCTTGTG 


ACCTCCAAGA 


6480 


TTTTCCATGC 


ATTCGTCCAT 


AATGATGGCA 


ATGGGCCCAC 


GGGCGGCGGC 


CTGGGCGAAG 


6540 


ATATTTCTGG 


GATCACTAAC 


GTCATAGTTG 


TGTTCCAGGA 


TGAGATCGTC 


ATAGGCCATT 


6600 


TTTACAAAGC 


GCGGGCGGAG 


GGTGCCAGAC 


TGCGGTATAA 


TGGTTCCATC 


CGGCCCAGGG 


6660 


GCGTAGTTAC 


CCTCACAGAT 


XTGCATTTCC 


CACGCTTTGA 


GTTCAGATGG 


GGGGATCATG 


6720 


TCTACCTGCG 


GGGCGATGAA 


GAAAACGGTT 


TCCGGGGTAG 


GGGAGATCAG 


CTGGGAAGAA 


6780 


AGCAGGTTCC. 


TGAGCAGCTG 


CGACTTACCG 


CAGCCGGTGG 


GCCCGTAAAT 


CACACCTATT 


6840 


ACCGGGTGCA 


ACTGGTAGTT 


AAGAGAGCTG 


CAGCTGCCGT 


CATCCCTGAG 


CAGGGGGGCC 


6900 


ACTTCGTTAA 


GCATGTCCCT 


GACTCGCATG 


TTTTCCCTGA 


CCAAATCCGC 


CAGAAGGCGC 


6960 


TCGCCGCCCA 


GCGATAGCAG 


TTCTTGCAAG 


GAAGCAAAGT 


TTTTCAACGG 


TTTGAGACCG 


7020 


TCCGCCGTAG 


GCATGCTTTT 


GAGCGTTTGA 


CCAAGCAGTT 


CCAGGCGGTC 


CCACAGCTCG 


7080 


GTCACCTGCT 


CTACGG CATC 


TCGATCCAGC 


ATATCTCCTC 


GTTTCGCGGG 


TTGGGGCGGC 


7140 


TTTCGCTGTA 


CGGCAGTAGT 


CGGTG CTCGT 


CCAGACGGGC 


CAGGGTCATG 


TCTTTCCACG 


7200 


GGCGCAGGGT 


CCTCGTCAGC 


GTAGXCTGGG 


TCACGGTGAA 


GGGGTGCGCT 


CCGGGCTGCG 


7260 


CGCTGGCCAG 


GGTGCGCTTG 


AGGCTGGTCC 


TGCTGGTGCT 


GAAGCGCTGC 


CGGTCTTCGC 


7320 


CCTGCGCGTC 


GGCCAGGTAG 


CATTTGACCA 


TGGTGTCATA 


GTCCAGCCCC 


TCCGCGGCGT 


7380 


GGCCCTTGGC 


GCGCAGCTTG 


CCCTTGGAGG 


AGGCGCCGCA 


CGAGGGGCAG 


TGCAGACTTT 


7440 


TGAGGGCGTA 


GAG CTTGGGC 


GCGAGAAATA 


CCGATTCCGG 


GGAGTAGGCA 


TCCGCGCCGC 


7500 


AGGCCCCGCA 


GACGGTCTCG 


CATTCCACGA 


GCCAGGTGAG 


CTCTGGCCGT 


TCGGGGTCAA 


7560 


AAACCAGGTT 


TCCCCCATGC 


TTTTTGATGC 


GTTTCTTACC 


TCTGGTTTCC 


ATG AG CCGGT 


7620 
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GTCCACGCTC 
CCTCGACCGA 
ATGACTATCG 
CCGGCAGCGC 
GGCCTGTCGC 
CCCGCCACCA 
CTGGGCTACG 
CTTCTCGCTT 
GATGACGACC 
ATCACTGGAC 
TTGGCATGGA 
GCATGGAGCC 
CCACTCCAAG 
CTTGGCAGAA 
GCAGCGTTGG 
TAGGCTGGCG 
GAAGCGACTG 
TCCGTGTTTC 
TCTGCATCGC 
CTTTCTCAAT 
GGCTGTGTGC 
CTTGAGTCCA 
ATTAGCAGAG 
GGCTACACTA 
AAAAGAGTTG 
GTTTGCAAGC 
TCTACGGGGT 
TTATCAAAAA 
TAAAGTATAT 
ATCTCAGCGA 
ACTACGATAC 
CGCTCACCGG 



GGTGACGAAA 
TGCCCTTGAG 
TCGCCGCACT 
TCTGGGTCAT 
TTGCGGTATT 
AACGTTTCGG 
TCTTGCTGGC 
CCGGCGGCAT 
ATCAGGGACA 
CGCTCATCGT 
TTGTAGGCGC 
GGGCCACCTC 
AATTGGAGCC 
CATATCCATC 
GTCCTGGCCA 
GGGTTGCCTT 
CTGCTGCAAA 
GTAAAGTCTG 
AGGATGCTGC 
GCTCACGCTG 
ACGAACCCCC 
ACCCGGTAAG 
CGAGGTATGT 
GAAGGACAGT 
GTAGCTCTTG 
AG C AG ATT AC 
CTGACGCTCA 
GGATCTTCAC 
ATGAGTAAAC 
TCTGTCTATT 
GGGAGGGCTT 
CTCCAGATTT 



AGG CTGTCCG 
AGCCTTCAAC 
TATGACTGTC 
TTTCGGCGAG 
CGGAATCTTG 
CGAGAAGCAG 
GTTCGCGACG 
CGGGATGCCC 
GCTTCAAGGA 
CACGGCGATT 
CGCCCTATAC 
GACCTGAATG 
AATCAATTCT 
GCGTCCGCCA 
CGGGTGCGCA 
ACTGGTTAGC 
ACGTCTGCGA 
GAAACGCGGA 
TGGCTACCCT 
TAG GT ATCTC 
CGTTCAGCCC 
ACACGACTTA 
AGGCGGTGCT 
ATTTGGTATC 
ATCCGGCAAA 
GCGCAGAAAA 
GTGGAACGAA 
CTAGATCCTT 
TTGGTCTGAC 
TCGTTCATCC 
ACCATCTGGC 
ATC AG CAATA 
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TGTCCCCGTA 
CCAGTCAGCT 
TTCTTTATCA 
GACCGCTTTC 
CACGCCCTCG 
GCCATTATCG 
CGAGGCTGGA 
GCGTTGCAGG 
TCGCTCGCGG 
TATGCCGCCT 
CTTGTCTGCC 
GAAGCCGGCG 
TGCGGAGAAC 
TCTCCAG CAG 
TGATCGTGCT 
AGAATGAATC 
CCTGAGCAAC 
AGTCAGCGCC 
GTGGAACACC 
AGTTCGGTGT 
GACCGCTGCG 
TCGCCACTGG 
ACAGAGTTCT 
TGCGCTCTGC 
CAAACCACCG 
AAAGGATCTC 
AACTCACGTT 
TTAAATTAAA 
AGTTACCAAT 
ATAGTTGCCT 
CCCAGTGCTG 
AACCAGCCAG 



TACAGACTTG 
CCTTCCGGTG 
TGCAACTCGT 
GCTGGAGCGC 
CTCAAGCCTT 
CCGGCATGGC 
TGGCCTTCCC 
CCATGCTGTC 
CTCTTACCAG 
CGGCGAGCAC 
TCCCCGCGTT 
GCACCTCGCT 
TGTGAATGCG 
CCGCACGCGG 
CCTGTCGTTG 
ACCGATACGC 
AACATGAATG 
CTGCACCATT 
TACATCTGTA 
AGGTCGTTCG 
CCTTATCCGG 
CAGCAGCCAC 
TGAAGTGGTG 
TGAAGCCAGT 
CTGGTAG CGG 
AAGAAGATCC 
AAGGGATTTT 
AATGAAGTTT 
GCTTAATCAG 
GACTCCCCGT 
CAATGATACC 
CCGGAAGGGC 



AGAGGCCTGT 
GGCGCGGGGC 
AGGACAGGTG 
GACGATGATC 
CGTCACTGGT 
GGCCGACGCG 
CATTATGATT 
CAGGCAGGTA 
CCTAACTTCG 
ATGGAACGGG 
GCGTCGCGGT 
AACGGATTCA 
CAAACCAACC 
CGCATCTCGG 
AGGACCCGGC 
GAG CG AACGT 
GTCTTCGGTT 
ATGTTCCGGA 
TTAACGAAGC 
CTCCAAG CTG 
TAACTATCGT 
TGGTAACAGG 
GCCTAACTAC 
TACCTTCGGA 
TGGTTTTTTT 
TTTGATCTTT 
GGTCATGAGA 
TAAATCAATC 
TGAGGCACCT 
CGTGTAGATA 
GCGAGACCCA 
CG AG CG CAG A 



7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 
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8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 
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9540 



WO 98/09657 




PCT/US97/15692 



52 



AGTGGTCCTG 


CAACTTTATC 


CGCCTCCATC 


GAGTCTATTA 


ATTGTTGCCG 


GG AAGCTAGA 


9600 


GTAAGTAGTT 


CGCCAGTTAA 


TAGTTTGCGC 


AACGTTGTTG 


CCATTGCTGC 


AGGCATCGTG 


9660 


GTGTCACGCT 


CGTCGTTTGG 


TATGGCTTCA 


TTCAGCTCCG 


GTTCCCAACG 


ATCAAGGCGA 


9720 


GTTACATGAT 


CCCCCATGTT 


GTGCAAAAAA 


GCGGTTAGCT 


CCTTCGGTCC 


TCCGATCGTT 


9780 


GTCAGAAGTA 


AGTTGGCCGC 


AGTGTTATCA 


CTCATGGTTA 


TGGCAGCACT 


GCATAATTCT 


9840 


CTTACTGTCA 


TGCCATCCGT 


AAGATGCTTT 


TCTGTGACTG 


GTGAGTACTC 


AACCAAGTCA 


9900 


TTCTGAGAAT 


AGTGTATGCG 


GCGACCGAGT 


TGCTCTTGCC 


CGGCGTCAAC 


ACGGGATAAT 


9960 


ACCGCGCCAC 


ATAGCAGAAC 


TTTAAAAGTG 


CTCATCATTG 


GAAAACGTTC 


TTCGGGGCGA 


10020 


AAACTCTCAA 


GGATCTTACC 


GCTGXTGAGA 


TCCAGTTCGA 


TGTAACCCAC 


TCGTGCACCC 


10080 


AACTGATCTT 


CAGCATCTTT 


TACTTTCACC 


AGCGTTTCTG 


GGTGAGCAAA 


AACAGGAAGG 


10140 


CAAAATGCCG 


GAAAAAAGGG 


AATAAGGGCG 


ACACGGAAAT 


GTTGAATACT 


CATACTCTTC 


10200 


CTTTTTCAAT 


ATTATTGAAG 


CATTTATCAG 


GGTTATTGTC 


TCATGAGCGG 


ATACATATTT 


10260 


GAATGTATTT 


AGAAAAATAA 


AGAAATAGGG 


GTTCCGCGGA 


CATTTCCCCG 


AAAAGTGCGA 


10320 


CCTGACGTCT 


AAGAAACCAT 


TATTATCATG 


ACATTAACCT 


ATAAAAATAG 


GCGTATCACG 


10380 


AGGCCCTTTC 


GTCTTCAA 










10398 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CATGGTAATA GCGATGACTA 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GCTCTGCTTA TATAGACCTC 
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(2) INFORMATION FOR SEQ ID NO* 4* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
ATAAGCTGCA ATAAACAAGT 
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